Small Gaps between Primes Exist 
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Abstract. In the recent preprint [3], Goldston, Pintz, and Yildirim established, among other things, 

(0) liminf Pn 1 +1 ~ P " =0, 

n-»oc logp„ 

with p n the nth prime. In the present article, which is essentially self-contained, we shall develop a simplified 
account of the method used in [3]. While [3] also includes quantitative versions of (0), we are concerned here 
solely with proving the qualitative (0), which still exhibits all the essentials of the method. We also show here 
that an improvement of the Bombieri- Vinogradov prime number theorem would give rise infinitely often 
to bounded differences between consecutive primes. We include a short expository last section. Detailed 
discussions of quantitative results and a historical review will appear in the publication version of [3] and 
its continuations. 

1. Basic Lemma 

In this section we shall prove an asymptotic formula relevant to Selberg's sieve, which is to be modified 
so as to involve primes in the next section. The two asymptotic formulas thus obtained will be combined 
in a simple weighted sieve setting, and give rise to (0) in the third section. 

Let N be a parameter increasing monotonically to infinity. There are four other basic parameters 
H, R,k,£ in our discussion. We impose the following conditions to them: 

(1.1) # <logY<logi?<logAr, 
and 

(1.2) integers k,£ > are arbitrary but bounded. 

To prove a quantitative assertion superseding (0), we need to regard k,£ as functions of N; but for our 
present purpose the above is sufficient. All implicit constants in the sequel are possibly dependent on k,£ 
at most; and besides, the symbol c stands for a positive constant with the same dependency, whose value 
may differ at each occurrence. 

Let 

(1.3) n = {h 1 ,h 2 ,...,h k }C[l,H]nZ, 
with hi 7^ hj for i ^ j. Let us put, for a prime p, 

(1.4) Fl(p) = {different residue classes among — h(modp), h £ Ti] 
and write n <= f2(p) instead of n (modp) € f2(p). We call H admissible if 

(1.5) \n(p)\<p for all p, 
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and assume this unless otherwise stated. We extend ft multiplicatively, so that n E Q(d) with square- free d 
if and only if n G f2(p) for all p\d, which is equivalent to 

(1.6) d\P(n;H), P{n;H) = {n + h 1 ){n + h 2 ) ■ ■ ■ (n + h k ). 

Also, we put, with fi the Mobius function, 

r if d > R, 

(L7) Md;a) = { }_^ d){logR/d) a - l{d < R ^ 



and 



(1.8) A R (n;H,a) = ]T X R (d;a) = 1 £ //(d) (log i?/d) a . 



a! 

riGO(d) d|P(n;«) 



With this, we shall consider the evaluation of 

(1.9) J2 A fl (n;H,fc + £) 2 . 

N<n<2N 

Expanding out the square, we have 

(1.10) J2x R (duk + e)\ R {d 2 ;k + e) L 

di,d 2 N<n<2N 

nen(di),nen(d 2 ) 

The condition on n is equivalent to n e , c?2]), with [di,(i 2 ] the least common multiple of the two 

integers; and 

(1.11) Yl ^R(^K,k + £) 2 = NT + o(fe\n(d)\\\ R {d ] k + t)\f\, 

N<n<2N \ d ) 

in which 

(1.12) T = V fc^A R (d,; + £)A fl (d 2 ; fc + £). 

We have |0(d)| < Tfc(d) with the divisor function Tfc. Thus 

(1.13) A fl (n;H,fc + ^) 2 = iVr + 0(i? 2 (logi?) c ). 

Af<n<2AT 

On noting that for a > 1 

/u(d) f / i?\ s ds 



(1.14) X R (d;a) 



2ni 7(i) V d J s a+1 ' 



with (a) the vertical line in the complex plane passing through a, we have 

1 /■ f R Sl+S2 
(2tt«) 2 7(i)7(i) (siS 2 ) fe+ ^ +1 
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where 



(Lie) F(si , S2!n) = E ^ lWA) JMi|| 



-n >- 



l 2 

|fi(p)| / 1 1 



pS2 pSl+S 2 



in the region of absolute convergence. 
Since = k ior p > H, we put 

(1.17) G(si,s 2 ;0) = J F(s 1 ,s 2 ;0) 



/ C(5i + i)C(g 2 + i) \ fc 

I c(si + S2 + i) y ' 



with £ the Riemann zeta-function. This is regular and bounded for Resi,Res2 > — c. In particular, we 
have the singular series 

(1.18) 6(H) = G(0, 0; fl) = J] (l - ™) (l - \) ~" , 

which does not vanish because of (1.5). We have the bound 

(1.19) G(*i,s 2 ;fi) <exp(c(log7V)^ 2<T logloglogiV), 

with min(Re s\, Re s 2 , 0) = cr > — c, as can be seen via the Euler product expansion of the right side of 
(1.17). In fact, the part corresponding to those p > H is uniformly bounded in the indicated region since 
|f2(p)| = k. For k 2 < p < H, the logarithm of each p-factor is estimated to be <C H~ 2a J2 p <h P _1 ! ano - * ne 
treatment of those p < k 2 is trivial. Note that the restrictions (1.1) and (1.2) are relevant here. 

Now, we have in place of (1.15) 



(1-20) ' r =TT-vA I G( Sl ,s 2 ;Sl)( „ aa *+** + 1 2 u ) 

(2ttz) 2 J(i)J(i) VC(si + 1)C(S2 + 1)/ (sis 2 ) k+i+1 



k R Sl+S2 



dsid,S2- 



Let us put U = exp(v / Tog~/V), and shift the si and s 2 -contours to the vertical lines (log?/) -1 + it and to 
(2logU))^ 1 + it, t e R, respectively. We truncate them to \t\ < U and \t\ < U/2, and denote the results by 
L\ and L 2 , respectively. On noting (1.1) and (1.19), we have readily that 

(1-21) T =TT^I I G (^ S2 ^( J iS lVr( + 1 \X( R \ + Z + i ds ^ ds2 

(2TTI) 2 J L J Ll VC(S1 + 1)C(«2 + 1)/ {siS 2 ) k+i+1 

+ O (exp(-cVlogiV)) • 

We then shift the si-contour to L 3 : — (logt/) -1 + it, \t\ < U; necessary facts about the functions £ and 
1/C can be found in [4, p. 53] (or p. 60 in the Second Edition). We encounter singularities at s\ = and 
Si = — s 2 , which are poles of orders £+1 and k, respectively. We have 

(1.22) T=-L.[ (rcs+ Res 1 ds 2 + O f exp(— c-v/log N)) , 

27r * Jl 2 l S l=° *l=-»2 J v / 

in which we have used (1.19). 

We rewrite the residue, and have 

(1-23) Res= -L[ G( Sl , S2 ;n) ( U*±±J*±V ) glgL^ , 

Sl =- S2 2iri J C(S2) VC(si + 1)C(S2 + 1)/ (sis 2 ) k + e + 1 
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with the circle C(s 2 ) : \si+s 2 \ = (log N)^ 1 . By (1.19), we have G(si, s 2 ; fi) < (loglogiV) c ; andi? Sl + S2 < 1, 
C(*i + s 2 + 1) < log TV. Also, since |s 2 | < < |s 2 |, we have (siC(«i + I)) -1 < (M + 1) _1 log(|s 2 | + 2), 
loc.cit. Thus 



/ , ,,,,-„ / lo g(M +2)\ 2fe 2l _ 2 



(1.24) Res «(logjV) fc ~ i (loglogiV) c ^ , z y |s 2 

«l=-«2 \ |S 2 | + 1 / 

Inserting this into (1-22), we get 

(1.25) T=±JjRe % \ ds2 + 0( { lo g N)^). 
To evaluate the last integral, we put 

M Ofi\ <7/ \ rV ^/(si +S 2 )C(S1 +S 2 + l)\ fe 

(1-26) s 2 = G si, s 2 ; — - — — — — , 

V s lC( s l + 1)S 2 C( S 2 + 1) / 

which is regular in a neighborhood of the point (0, 0). Then we have 
(1-27) Res^fAV (^4^1. 

We insert this into (1-25) and shift the contour to L4 : — (21ogC/) _1 +it, \t\ < U/2. We see the new integral 
is O (exp (— c^/log N) ) ; the necessary bound for the integrand could be obtained in much the same way as 
(1.24). Thus 



(1.28) T = Res Res +0((logN) k+t ) 

s 2 =0 si=0 



S1+S2 



> / / ( z ''-y + i, ^ + o(co ey) «), 

™) 7c 2 id ( s i + s 2 ) fc (sis 2 f +1 



(2m 

where Ci, C 2 are the circles |si| = p, \s 2 \ — 2p, with a small p > 0. We write si = s, s 2 — s£. Then the 
double integral is equal to 

(i - 29) » i 3 y Cl k+w^^' 



(L30) Jft + K)! (l0S R)k+n Ic 3 {A W^^ + ° ((l0g Ar)fc+2 '" 1 (1 ° S l0g ^ 



where C 3 is the circle |£| = 2. This is equal to 

\2l 

\S T J- 

where we have used (1.19). 

Hence, we have obtained our basic implement: 

Lemma 1. Provided (1.1), (1.2), and i? < AT 1 / 2 /(log 7V) C hold with a sufficiently large C > depending 
only on k and I, we have 



(1.31) £ A*(n;H,fc + ^ = (f) JV(Iogi2)*+ M + 0(JV(log A^+^Ooglog 

N<n<2N ^ /• \ / 



A) c ). 
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2. Twist with Primes 

Next, let us put 

(2 1) win) — { l°S n if ^ is a prime, 

[ otherwise; 

and consider the evaluation of the sum 

(2.2) ™(n + h)A R {n-H,k + £) 2 , 

N<n<2N 

with an arbitrary positive integer h < H. We observe that by (1.6) this is equal to 



(2.3) ™{n + h)A R (n;H\{h},k + . 



N<n<2N 



provided R < N; in fact, if w(n + h) ^ and h e H, then the factor n + h of P(n;H) is irrelevant in 
computing A#(n; H;k + £). 

We shall work on the assumption: There exists an absolute constant < 9 < 1 such that we have, for 
any fixed A > 0, 

(2.4) V max max \d* (y ; a, q) - y/ip(q)\ <^ x /(log x) A , $*(y;a,q) = V w(n), 

q<x» ~ (o,9)=l y<n<2y 

n=a mod g 

with the implicit constant depending only on A. We assume that 

(2.5) R< N e l 2 /{\ogN) A . 

In particular, (2.3) implies that we may assume also that h ^ H. 

With this, expanding out the square in (2.2), we see that the sum is equal to 

(2.6) Yl Mdi;k + £)X R (d 2 ;k + e) ]T S((b + h, [d u d 2 }))#* (N; b + h, [d u da]), 

d ± ,d 2 6eO([di,d 2 ]) 

where S(x) is the unit measure placed at x = 1, because fl* (N,b + h, [di,d 2 ]) = if b + h and [di,^] are 
not coprimc. Then, by (2.4), this is equal to 

(2.7) NT* + 0(N/ (log N) A / 3 ), 
with 

(2.8) ^=E W; *trfff ;fc + /) £ W + M*,*]))- 

The error term in (2.7) might require an explanation: We divide the sum in (2.6) into two parts according 
as |f2([c?i, d 2 })\ = Tk([di, d 2 ]) < (logN) A / 2 and otherwise. To the first part we apply (2.4), while the second 
part is 



T k ([d u d 2 }) \n([d u d 2 ])\ 

d x ,d 2 <R 

provided A is sufficiently large. 



(2.9) <<iV(logi?)^>logiV Y, ^^'T^^ ^g^ 3 - 
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It remains for us to evaluate T*. The inner sum of (2.8) is equal to 

(2.10) n [ s((b+h,p))) = n (\n+( P )\-i). 

p|[di,d 2 ] \ben( P ) J p\[di,d2] 

Here f2 + corresponds to the set 7i + = Ti U {h}. In fact, 8((b + h,p)) vanishes if and only if — h € Q(p); 
and the latter is equivalent to Q(p) = Q + (p). Note that the analogue of (1.5) for 0+ could be violated. 
Nevertheless, we have, as before, 

r -W?IJ m n (• - ^r 1 (£ + f- J^)) 

For p > H, we have |f2 + (p)| = fc + 1, since h ^H. Thus, we consider the function 



(2.12) [](•••) 



C(*l + 1)C(«2 + i) 
C(Si+S 2 + l) 



fc 



as in (1.17). If H + is admissible, the singular series is &(H + ) and the argument and computation of residues 
is analogous to above. Thus we find that provided h <^TL 

(2.13) T* = ^^(^)(Iogil)^« + O((IogJ\0 fc+ - M - 1 (IoglogJ\0 c ). 

On the other hand, if H + is not admissible or &(TL + ) = 0, then the Eulcr product in (2.11) vanishes at 
either si = or s 2 = to the order equal to the number of primes such that |f2 + (p)| = p. However, since 
we have then p < k + 1 , the necessary change to the above reasoning results only in the lack of the main 
term in (2.13) and the error term remains to be the same or actually smaller. 

Finally, if h £ H, the above evaluation applies with the translation fc ^ fc— 1, I t + 1 to (2.13) 
because of (2.3). 

From this, we obtain 

Lemma 2. Provided (1.1), (1-2), and (2.4) hold, we have, for R < N e / 2 /(log N) c with a sufficiently large 
C > depending only on k and I, 

(2.14) ^ w(n + h)K R (n;H,k + l) 2 

N<n<2N 

6 ( ( ^ + U ff } (^(logi?)^ + O^QogiVf+^GoglogiVr) ifhtH, 
_©(W) (2(1 + 1)\ N ^ R)k+ 2i+i + 0(7V(log jv)*+ M (log log N) c ) if hen. 



(k + 2l+l)\\ £+1 



3. Proof of (0) 

We are now ready to prove (0). We shall evaluate the expression 

(3.1) 2 \^2Mn + h)-log3N \ A R (n;H, k + if 

HC[1,H] N<n<2N \h<H I 
\H\=k 
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If this turns out to be positive, then there exists an integer n <G (N, 2N) such that 

(3.2) ^w{n + h)- log 3iV > 0. 

h<H 

That is, there exists a subinterval of length H in (N, 2N + H] which contains two primes; hence 

(3.3) min (p r +i — Pr) < H. 

N<p T <2N+H 

Here, we need to quote, from [2], 

(3.4) ]T &(H) = {l + o(l))H\ 

HC[1,H] 
\H\=k 

as H tends to infinity. With this and Lemma 1, we see that (3.1) is asymptotically equal to 



(3.5) E { E + E }™{n + h)k R {n;H,k + tf 



HC[1,H] N<n<2N h<H h<H 

\n\=k K WH hen 



2t 



{k + 2£y\£ 



NH k '(log TV) (log R) k+2e , 



with an admissible error of the size of o (iV_ff fc (logiV) fc+2£+1 ). By Lemma 2 and (3.4) with an appropriate 
replacement of H, this is asymptotically equal, in the same sense, to 

'I^Tw( 2 /)™'< 1 ^ JV >< loe^! >' +2, 

Hence (3.1) is positive, provided 

(3.7) H>(l + e 2 -^l. e -)lo g N, 

with any fixed e > 0. Therefore, with I = [Vk], say, we obtain 

(3.8) lim inf - Pn < max {0, 1-26}. 

n^oo logp n 

In particular, the Bombieri- Vinogradov Prime Number Theorem [1, Theoreme 17] gives rise to the assertion 
(0). This ends the proof. 

Finally, we shall exhibit two conjectural assertions: 

1) If we have 6 > ^, then there will be infinitely many n such that p n +i — Pn < c(9) with an absolute 
constant c(8). In fact, we would be able to suppose H > c(9) in the above as far as (3.6), and the assertion 
follows immediately 

2) If we have 9 > 22 ) then we will be able to assert the simultaneous appearance of primes in admissible 
7-tuples. For instance, at least two of the seven integers {n, n + 2, n + 6, n + 8, n + 12, n + 18, n + 20} will 



8 



be primes for infinitely many n. In particular, p n +i — Pn < 20 infinitely often. To prove this, let Ti be such 
a tuple, and consider, in place of (3.1), 

(3.9) J2 ( J2 ru{n + h) -\og2,N J A fi (n;H,8) 2 . 

N<n<2N \heT-l ) 

Lemmas 1 and 2, with fc = 7, £ = 1, imply that under the present assumption on 9 this is asymptotically 
equal to 

(3.10) |s(W) (^logi?-logA^ 7V(logi?) 9 >0, 

provided i? = iV 10 / 21+ ^ with sufficiently large AT and a small £ > 0. Hence the assertion follows. 

4. Expository 

The principal idea in [3] is the amazing effect induced by the introduction of the parameter £ in (1.9). 
The sieve weight n(d) (log m/d) k+e , d\m, applied to the polynomial m = P(n;Tt) detects n with which 
P(n;Tt) has k + £ distinct prime factors at most, implying that the integers n + hj, j < k, are mostly 
primes, provided k is large compared with I. By a standard method in this field, we approximate these sieve 
weights by \ii(d; k + £), and consider the Selberg sieve situation (1.9), with the parameters I and R at our 
disposal. An asymptotic formula for the sum (1.9) is given in (1.31). Then, to detect at least two primes 
among n + hj, j < k, a usual weighted sieve situation is considered at (3.1); for this the other asymptotic 
formula (2.14) is required. The upshot is condensed in (3.6) and (3.7). The proof of (0) requires that both 
k and I can be taken appropriately and the Bombieri- Vinogradov prime number theorem is available. 

Rendering the above more technically, the reason for success lies not only in the introduction of the 
parameter I but also in the trivial fact (2.3), which brings forth the translation I i— » I + 1 as remarked in 
the proof of Lemma 2. This introduces the factor ( 2 £ £ _ t f 1 1 ^) on the right of (2.14). One should note that 

C^e+P) I Ce) = 2 (f+i)^ ' which tends to 4 as £ — > oo. This is extremely critical when appealing to the 
Bombieri- Vinogradov prime number theorem. On the other hand, the translation k k — 1 does not cause 
any effect as long as k is much larger than £. 
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